Finding Good Conversations Online: The Yahoo News Annotated Comments Corpus
نویسندگان
چکیده
This work presents a dataset and annotation scheme for the new task of identifying “good” conversations that occur online, which we call ERICs: Engaging, Respectful, and/or Informative Conversations. We develop a taxonomy to reflect features of entire threads and individual comments which we believe contribute to identifying ERICs; code a novel dataset of Yahoo News comment threads (2.4k threads and 10k comments) and 1k threads from the Internet Argument Corpus; and analyze the features characteristic of ERICs. This is one of the largest annotated corpora of online human dialogues, with the most detailed set of annotations. It will be valuable for identifying ERICs and other aspects of argumentation, dialogue, and discourse.
منابع مشابه
Using New York Times Picks to Identify Constructive Comments
We examine the extent to which we are able to automatically identify constructive online comments. We build several classifiers using New York Times Picks as positive examples and non-constructive thread comments from the Yahoo News Annotated Comments Corpus as negative examples of constructive online comments. We evaluate these classifiers on a crowdannotated corpus containing 1,121 comments. ...
متن کاملThe SENSEI Annotated Corpus: Human Summaries of Reader Comment Conversations in On-line News
Researchers are beginning to explore how to generate summaries of extended argumentative conversations in social media, such as those found in reader comments in on-line news. To date, however, there has been little discussion of what these summaries should be like and a lack of humanauthored exemplars, quite likely because writing summaries of this kind of interchange is so difficult. In this ...
متن کاملمطالعۀ الگوهای جمعیتشناختی و رفتاری خوانندگان برای اشاعۀ گزینشی اخبار
Purpose: The current research focuses on selective dissemination of news and aims at finding patterns for recognition of readers’ favorite news through web mining technique. Method: Data for this research was collected from the Yahoo News Website. The source of news was Associated Press. 840 news dated between 2011/3/1 and 2011/5/10 was analyzed through subject clustering technique. Findings:...
متن کاملAutomatically Identifying Good Conversations Online (Yes, They Do Exist!)
Online news platforms curate high-quality content for their readers and, in many cases, users can post comments in response. While comment threads routinely contain unproductive banter, insults, or users “shouting” over each other, there are often good discussions buried among the noise. In this paper, we define a new task of identifying “good” conversations, which we call ERICs—Engaging, Respe...
متن کاملMultilevel Annotation of Agreement and Disagreement in Italian News Blogs
In this paper, we present a corpus of news blog conversations in Italian annotated with gold standard agreement/disagreement relations at message and sentence levels. This is the first resource of this kind in Italian. From the analysis of ADRs at the two levels emerged that agreement annotated at message level is consistent and generally reflected at sentence level, and that the structure of d...
متن کامل